The SuperARV Language Model: Investigating the Effectiveness of Tightly Integrating Multiple Knowledge Sources

نویسندگان

  • Wen Wang
  • Mary P. Harper
چکیده

A new almost-parsing language model incorporating multiple knowledge sources that is based upon the concept of Constraint Dependency Grammars is presented in this paper. Lexical features and syntactic constraints are tightly integrated into a uniform linguistic structure called a SuperARV that is associated with a word in the lexicon. The SuperARV language model reduces perplexity and word error rate compared to trigram, part-of-speech-based, and parser-based language models. The relative contributions of the various knowledge sources to the strength of our model are also investigated by using constraint relaxation at the level of the knowledge sources. We have found that although each knowledge source contributes to language model quality, lexical features are an outstanding contributor when they are tightly integrated with word identity and syntactic constraints. Our investigation also suggests possible reasons for the reported poor performance of several probabilistic dependency grammar models in the literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The robustness of an almost-parsing language model given errorful training data

An almost-parsing language model has been developed [1] that provides a framework for tightly integrating multiple knowledge sources. Lexical features and syntactic constraints are integrated into a uniform linguistic structure (called a SuperARV) that is associated with words in the lexicon. The SuperARV language model has been found able to reduce perplexity and word error rate (WER) compared...

متن کامل

Using Multiple-Variable Matching to Identify EFL Ecological Sources of Differential Item Functioning

Context is a vague notion with numerous building blocks making language test scores inferences quite convoluted. This study has made use of a model of item responding that has striven to theorize the contextual infrastructure of differential item functioning (DIF) research and help specify the sources of DIF. Two steps were taken in this research: first, to identify DIF by gender grouping via l...

متن کامل

Investigating the Position of Reason in the Method of Mystical and Philosophical Knowledge from the Perspective of Rumi and Ibn Sina

     Reason is one of the sources of knowledge in human existence and human forces that distinguishes between good and evil or right and wrong. This has led many scientists and mystics to reflect on this issue among whom are Ibn Sina and Rumi. The purpose of this study is to study the nature of reason, its relationship with love and religion, factors and obstacles to reaching the perfection of ...

متن کامل

On Teaching to Diversity: Investigating the Effectiveness of MI-Inspired Instruction in an EFL Context

This study reports an experiment conducted to investigate the effectiveness of implementing MI-inspired instruction in an EFL context. To this end, a group of ten intermediate female students took part in a quasi-experimental study. At the beginning of the experiment, Multiple Intelligences Survey (Armstrong, 1993) was administered to determine the participants’ MI profiles. The participants we...

متن کامل

Investigating the relationship among complexity, range, and strength of grammatical knowledge of EFL students

Assessment  of  grammatical  knowledge  is  a  rather  neglected  area  of  research  in  the  field with  many  open  questions  (Purpura,  2004).  The  present  research  incorporates  recent proposals  about  the  nature  of  grammatical  development  to  create  a  framework  consisting of dimensions of complexity, range and strength, and studies which dimension(s) can best predict the stat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002